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ABSTRACT 


The authors have identified the following significant results. Preliminary 
dijgital analysis of ERTS MSS imagery reveals that the textural features of the imagery 
are very useful for land use classification. A procedure for extracting the textural 
features of ERTS imagery is presented and the results of a land use classification 
scheme based on the textural features are also presented. 

The land use classification algorithm using textural features was tested on a 
5100 square mile area covered by part of an ERTS-A MSS band 5 image over the 
California coastline. The image covering this area was blocked into 648 subimages 
of size 8.9 square miles each. Based on a color composite of the image set, a total 
of 7 land use categories v/ere identified. These land use categories are: Coastal 
forest, woodlands, annual grasslands, urban areas, large irrigated fields, small 
irrigated fields, and water. 

! The automatic classifier was trained to identify the land use categories using 
only the textural characteristics of the subimages; 75 per cent of the subimages were 
assigned correct identifications. Since texture and spectral features provide com- 
pletely different kinds of information, a significant increase in identification 
accuracy-w/ill take place when both features are used together. 
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PRELIMINARY REPORT ON LAND USE CLASSIFICATION USING TEXTURE 
INFORMATION IN ERTS-A MSS IMAGERY 


I. INTRODUCTION 


Two fundamental pattern elements used in human interpretation of pictorial 
data are texture and tone. The concept of tone is based upon varying shades of 
grey of resolution cells in the photographic image while texture is concerned with 
the spatial statistical distribution of grey tones. Much of the pattern recognition 
methods which have been used to process remotely sensed image data have been 
mainly concerned with temporal and spectral characteristics on a resolution cell 
level. Unfortunately such an approach does not take into account textural and 
context type information. Because the areal characteristics of texture and context 
carry so much information, it is important to develop appropriate pattern recognition 
features for them. 

We describe in this report a procedure for determining the textural features 
on ERTS imagery. The textural features we have developed are extracted from the 
matrix of relative frequencies P- j with which two neighboring resolution cells with 
a specific spatial relationship occur on the image, one with grey tone i and the other 
with grey tone j. Such matrices are calculated for each MSS image for different 
angular relationships between neighboring resolution cells as well as for different 
distances between them. From each of the different matrices, we extract features 
which contain information about the homogeneity of the image, linear dependencies 
in grey tone values, contrast and other image qualities and use these features for 
image classification. 

Using the textural feature extraction-classification algorithm we have 
developed, we processed part of an ERTS MSS image over the California coast. 

We divided a 5400 square mile area on the image into 648 subimages each covering 
an area of approximately 8.5 square miles. The ground truth for the land use types 
for 629 of these images were obtained through photo-interpretation. The textural 
features were computed for 629 of the 648 subimages. Each subimage was then 
classified as belonging to one of seven possible land use categories using part of the 
data for training and the remainder of the data for testing the classification accuracy. 
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We found that over 70 per cent- of the test samples can be correctly classified using 
the textural features of only one of the four MSS images. We conjecture that a much 
higher classification accuracy may be reached if the spectral information is used in 
addition to the textural features for all four MSS images in the set. 


II. TEXTURAL FEATURE EXTRACTION PROCEDURE 


Other than some work with Fourier, Fladamard, and the autocorrelation 
function (Rosenfeld and Troy, 1970; Kaizer, 1955) there exists little or no theory to 
aid in establishing what the textural features should consist of. Rather, the feature 
extraction operation is determined intuitively, rationalized heuristically, and later 
justified pragmatically and empirically. For automatic analysis of MSS and other 
imagery we have developed a procedure to extract textural features from the spatial 
grey tone dependence matrix which is computed for each image. 


II. 1 Spatial Grey Tone Dependence Matrix 

Let L = {1,2,... / N } and L = (1 ,2, ..., N ) be the x and y spatial 
x x y y 

domains and L xL be the set of resolution cells. Let G = ( 0, 1 , . . . , N } be the 
y x 9 

set of possible grey tones. Tben a digital image I is a function which assigns some 
grey tone to each and every resolution cell; hL^xL^-* G.* 

An essential component of our conceptual framework of texture is a matrix, 
or more precisely, four closely related matrices from which all texture-context 
features are derived. These matrices are termed angular nearest neighbor grey tone 
spatial dependence matrices. 

We assume that the texture-context information in an image I is contained 
in the over-all or "average" spatial relationship which the grey tones in image I 
have to one another. More specifically, we shall assume that this texture-context 
information Is adequately specified by the matrix of relative frequencies P.j with 


*The spatial domain L y xL x consists of ordered pairs whose components are row and 
column respectively/ This convention conforms with the usual two subscript 
row-column designation used in FORTRAN. 
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45 degrees 


0 degrees 


Figure 1. Resolution cells nos. 1 and 5 are the 0-degree (horizontal) nearest 

neighbors to resolution cell resolution celis nos. 2 and 6 are the 
135-degree nearest neighbors, resolution cells 3 and 7 are the 90- 
degree nearest neighbors, and resolution cells 4 and 8 are the 45- 
degree nearest neighbors to (Note thatthis information is purely 

spatial, and has nothing to do with grey tone values.) 


which two neighboring resolution cells separated by distance d occur on the image, 
one with grey tone i and the other with grey tone j (see Figure 1). Such matrices 
of spatial grey tone dependence frequencies are a function of the angular relation- 
ship between the neighboring resolution cells as well as a function of the distance 
between them. Figure 2 illustrates the set of all horizontal neighboring resolution 
cells separated by distance 1 . This set along with the image grey tones will be 
used to calculate a distance 1 horizontal spatial grey tone dependence matrix. For 
mally, for angles quantized to 45° intervals the unnormalized frequencies are 
defined by: 


P(i,j,d,0°) = # |((k,l), (m,n)) e (L y xL x )x(L y xL x )| k-m=0, |l~n|=d, l(k,l)=i, l(m,n)=i| 
P(i, j,d,45°)= # j((k,l),(m,n))e (L y x )x (L y x L x ) J (k-m=d, l-n= -d) or (k-m= -d, l-n=d), 

l(k,l)=i, l(m,n)=j } 

P(i,i,d,90°)= # {((k,l),(m,n))e (L y xL x )x(L y xL x )| |k-mj =d, l-n=0, l(k, I >=i , l(m,n)=j} 
P(i,j,d, 135°)= ((k,l),(m,n))e (L xL )x(L xL )| (k-m=d, l-n=d) or (k-m= -d, l-n= -d), 

y x y a 1 

i(k,l) = i, l(m,n) = j } 
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L y = |l,2,3,4| 

L x = |1, 2,3,4 


0,1) 

(1,2) 

(1,3) 

(1,4) 

(2,1) 

(2,2) 

(2,3) 

(2,4) 

(3,1) 

(3,2) 

(3,3) 

(3,4) 

(4,D 

(4,2) 

(4,3) 

(4,4) 


= {((k,D,(m,n))e (L y xL x ) x(L,xl x ) |k-m = 0, | i-n I = lj 

= |((V,1), (1,2)), ((1,2), (1,1)), ((1,2), (1,3)), (0,3), (1,2)), 

(0,3), 0,4)), (0,4), (1,3)), ((2,1), (2,2)), ((2,2), (2,1)), 
((2,2), (2,3)), ((2,3), (2,2)), ((2,3), (2,4)), ((2,4), (2,3)), 

((3,1), (3,2)), ((3,2), (3,1)), ((3,2), (3,3)), ((3,3), (3,2)), 

((3,3), (3,4)), ((3,4), (3,3)), ((4.1), (4,2)), ((4,2), (4,1)), 

((4,2), (4,3)), ((4,3), (4,2)), ((4,3), (4,4)), ((4,4), (4,3))} 


Figure 2 illustrates the set of all distance 1 horizontal neighboring 
resolution cells on a 4 by 4 image. 
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Note that these matrices are symmetric; P(i,j; d,a) = P(j,i; d,a). The 
distance metric P implicit in the above equations can be explicitly defined by 
P((k,l), (m,n)) = max | ik-m| , |l _ n|}. 

Consider Figure 3-a, which represents a 4 x 4 image with four grey tones, 
ranging from 0 to 3. Figure 3-b shows the general form of any grey tone spatial 
dependence matrix. For example, the element in the (2,l)~st position of the 
distance 1 horizontal P^ matrix is the total number of times two grey tones of value 
2 and 1 occurred horizontally adjacent to each other. To determine this number, 
we count the number of pairs of resolution cells in R such that the first resolution 
cell of the pair has grey tone 2 and the second resolution cell of the pair has grey 
tone 1. In Figures 3 - c through 3-f we calculate all four distance 1 grey tone 
spatial dependence matrices. 
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any grey tone spatial dependence 
matrix for an image with integer 
grey tone values 0 to 3. #(i, j) 
stands for number of times grey 
tones i and j have been neighbors. 


Figure 3-a. 


Figure 3-b. 
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Figure 3-f. 


Hie appropriate frequency normalization for these matrices can be easily computed. 
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II. 2 Textural Features 


From each of the grey lone dependency matrices we extract a set of 32 
textural features. The equations which define these features are given in Appendix 
A of this report. For illustrative purposes we define three of these features and 
explain their significance. 


N N 



N N 

yf 'i PQ.i) . 


m y 

x p y 


°x°y 


where 

#R = number of resolution cells pairs, and p and a are the mean and 

X X 

standard deviation of the marginal distribution defined by 



and u and a are the mean and standard deviation of the marginal distribution P 

y y : 

defined by 


N 



# R 
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a. Grassland 
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Contrast 

Correlation 

0° 
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135° 
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.4610 

Avg. 

.0087 

3.945 

.6259 


Figure 4. Texfural Features for 1 
Images. 
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b. Water Body 
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Different Land Use Category 






To explain the significance of these features, let us consider the kind of values they 
take on two different land use category images. Figure 4 shows the digital printout 
of two sub-images of size 64 x 64 resolution cells (approximately 8.5 square mile 
area) from MSS band 2 of the California frame. The image shown in 4(a) belongs 
to the grass land category and image shown in Figure 4(b) is mostly water. Values 
of the features f^ , an ^ *3 are a ^ so s ^ own f°r these images in Figure 4. 

The angular second moment features (ASM), fj, is a measure of homogeneity 
of the image. In a homogeneous image, such as shown in 4(b), there are very few 
dominant grey tone transitions. Hence, the P matrix for this image will have fewer 
entries of large magnitude. For an image like the one shown in Figure 4(a), the 
P matrix will have a large number of small entries and hence the ASM feature which 
is the sum of squares of the entries in the P matrix will be smaller. A comparison of 
the ASM values given below the images in Figure 4 shows the usefulness of the ASM 
feature as a measure of the homogeneity of the image. 

The contrast features, ' s obtained as a difference moment of the P matrix 
and is a measure of the contrast or the amount of boundaries present in an image. 
Since there is a large amount of boundaries present in the image 4(a) compared to 
the image shov/n in 4(b), the contrast feature for the grassland image has consistently 
higher values compared to the water body image. 

The correlation feature, fg, is a measure of linear grey tone dependencies 
in the image. For both the images shown in Figure 4, the correlation features is 
somewhat higher in the horizontal (0°) direction, along the line of scan. The water 
body image consists mostly of a constant grey tone value for the water plus some 
additive noise. Since the noise samples are mostly uncorrelated, the correlation 
features for the water body image have lower correlation values compared to the 
grassland image. Also the grassland image has a considerable amount of linear 
structure along 45° lines across the Image and hence the value of the correlation 
feature is higher along this direction compared to the values for 90° and 135° 
directions. 

The various features which we suggest are all functions of distance and 
angle. The angular dependencies present a special problem. Suppose image A 
has features a, b, c, d for angles 0°, 45°, 90°, and 135° and image B is identical 
to A except that B is rotated 90° with respect to A. Then B will have features 
c, d, a, b for angles 0°, 45°, 90°, and 135° respectively. Since the texture 
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context of A is the same as the texture context of B, any decision rule using the 
angular features a, b, c, d must produce the same results for c, d, a, b or for 
that matter b, c, d, a (45° rotation) and d, a, b, c (135° rotation). To guarantee 
this, we do not use the angularly dependent features directly. Instead, we use 
two symmetric functions of a, b, c, d, their average, and their range. These 
features can be represented in vector form F = [f j , f 2 , • • •/ f n l / where f j , f 2 , • • /^ n 
are the values of the features, T denotes the transpose. 

The textural features used in our study were computed for four angles and 
for a distance of one. 


HI. IDENTIFICATION PROCEDURE 
III . I Introduction 

The problem of developing procedures for categorizing environmental units 
consists of the following. 

With reference to Figure 5, the universe U consists of environmental units 
(for example rocks) U 1 ,U 2 , ... ,Uy which belong to one of R possible categories 
Cj,C 2 ,...,C^ (different land use categories). Of the large number of environ- 
mental units present in the universe, we observe a smaller subset of units 

,U 2 / . . .,U|^. Our observations consist of a set of measured values of n features 
fj,f 2 /.../f n for each unit U sampled. Based on the information contained in the 
feature vectors F|,F 2 ,...,F^, the categories of the environmental units which 
produce these measurements being known, we want to develop an algorithm to 
identify the categories of new units based on the measurements they produce. 
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Figure 6. IdenHficaMon Scheme. 






The decision rule which assigns categories based on the values of features 
may be implemented in the feature space F by partitioning F into various regions 
and assigning categories to new units based on the regions to which their feature 
vectors belong. Efficient partitioning of the feature space may require complicated 
nonlinear decision boundaries (discriminant functions). Instead of deriving a deci- 
sion rule in the feature space F, we may transform the feature vectors into a new 
space X and implement a decision rule in the new space X. By using appropriate 
nonlinear transformations, we may be able to implement nonlinear decision boun- 
daries in F as linear decision boundaries in X. Several procedures are available for 
deriving linear decision boundaries for partitioning X into various regions, based on 
the information contained in a set of sample patterns Xj,X 2 ,...,X^ whose cate- 
gories are known . 


Identification Algorithms : In a widely used algorithm (Fukunaga 1972, Fu and 
Mendel 1970, Miesel 1972), the pattern space X is separated into a number of 
regions using a set of hyperplanes (decision boundaries) whose locations are deter- 
mined by the sample patterns. Each region is dominated by sample patterns of a 
particular category. When a new pattern is presented for identification, it is 
assigned a category depending on the region in which it belongs. If the new pattern 
X is located in a region dominated by sample patterns of category c., then X is 
classified as coming from category c.. 

To illustrate the procedure for obtaining the hyperplanes, consider the 

problem of separating the sample patterns '^2' ’ ' ' belonging to category c. 

and X , X , ...,X belonging to category c. . We can write the 

linear discriminant function (hy'perp^ane) which separates the patterns belonging to 

categories c. and c. as 
1 J 


h.. (X) - V.. X + v.. > 0 for Xec ■ , 
ij ij ij i 

hjj (X) = V.j X + v.° < 0 for Xecj 


(3) 


The vector V.. and the scalar v.j are to be determined from the information contained 
on the sample patterns. 

If we introduce a new form to express the pattern vectors as 


z 

z 


( +1 Xj x 2 

[-1 -> 


.. x ]' 
n 


for Xec. 


-x 0 ... -x ] 
2 n 


for Xec. 
J 


(4) 
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then the discriminant function can be written as 


h.. (Z) = W..Z > 0 

•J 'J 


(5) 


where W.. is referred to as a weiqht vector and 

•J 

h.. (Z) = W.Jz = 0 (6) 

•J 'J 

is the equation of a hyperplane in the transformed feature space. 

The weight vector W.. is chosen so as to satisfy equation 5 for as many 

training patterns as possible. Usually we do not know the precise form of h... But, 

given our knowledge of the categories of the training patterns, we can postulate 

reasonable values a.. (Z, ) for h.. (Z, ) and choose W.. to minimize the mean square 
°ij k ij k ij 

error given by 


r\ 1 n. + n. T r\ 

e z = — 1— ' I J (W.. Z, - g.. (Z, ) T 

n i n j k=l ,J k ,J k 


(7) 


Usually g.j (Z^) is taken to be +1 for k = 1 ,2, . . . ,n. + n. . We can rewrite e as, 

1 e 2 = , I - y [ W.Jy - G.J ] [ Y T W.. - I ] (8) 

(n. + n.) ij ij ij 


where 


Y = l z, Z„ ... Z ], and 

I jl n. + n. 

i 


j 


V'W g ii (Z 2)---Sij( Z n ! + n.) 


The weight vector which minimizes f given in equation 8 is given by 


W.. = (YY T f 1 YG.. 

ij ' ' ij 


(9) 


which is the well-known normal equation set from linear least square theory. 

For the multicategory problem involving N D categories, a total of N D (N D ~l)/2 

K K K 

hyperplanes must be determined using the procedure described above. After the 
hyperplanes are determined, the classification of new patterns is done as follows. 

For each category c., the number of hyperplanes, V., which give a positive response 
when the new pattern X is presented are determined using 

Nr, 


V. - L 

' j=l 
j* 


|W.Jz|+ W. T z ; i = 1,2, ...Np 

U LL_ R 

2 |W.. T Z| 

•J 


00 ) 
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where 


z = [x'- 

X Is assigned to category c. if 
V. = max f V. } 

J i 

If there is a tie between categories c m and c n , then X is assigned to c m if W mr) Z 

or to c if W < 0. Several modifications of the linear discriminant function 
n mn 

method and a multitude of other classification procedures may be found in the 
references cited. 



IV. CLASSIFICATION EXPERIMENTS 


IV. 1 MSS Data Used 

All of the data used in our present study were derived from the MSS 5 
band of ERTS-A image frame 1002-18134. The date of the flight was July 25, 1972, 
and the center coordinates of the frame were 37.291 N, 120.935W. The area of 
coverage includes the San Francisco Urban area and the Monterray Bay on the coast 
line of California. Of the four image strips in this frame, all of strip one and one- 
half of strip three were digitally processed. The second MSS band image and the 
area processed are shown in Figure 6. 

As a first step in the digital processing, the image was divided into a total 
of 648 sub-images of size 64 x 64 resolution cells (each covering a ground area of 
8.5 square miles). For each of the sub-images, the digital spectral data from 
sensor band five (MSS band two) were extracted. The imagery data for each sub- 
image was then normalized using an equal probability quantizing algorithm and the 
32 textural features for each sub-image were then computed. 

Ground truth for each of the 648 sub-images were obtained with the help of 
photointerpreters working on the MSS images and the color composite image. A 
total of seven land use categories could be identified reasonably well and these 
categories are the following. 

IV. 2 Categories of Land Use Detectable on ERTS Image (1002-18134) 

(1) Coastal Forest 

This area can be typified as a dense temperate rainforest sited on the wind- 
ward side of the Coast Ranges of California. It is composed of needleleaf and 
broadleaf evergreen trees which generally form a continuous cover. 

(2) Woodlands 

These are areas to the leeward side of the Coast Ranges. At higher eleva- 
tions, these woodlands consist of oaks, mostly evergreen, with varying but less 
continuous coyer. At lower elevations these oak woods grade into chapparal which 
provides decreasing cover with decreasing elevation until it gives way to annual 
grasslands. 
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Figure 6. 


Two strips whose boundories ore shown on the top 
were processed from this MSS image set (1 G02-1 81 34) . 
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(3) Annual Grasslands 

This is an area of non-native annual grasses which have already completed 
their life cycle by the end of July (turned brown). This area is primarily, like the 
preceding two categories, quite mountainous. This is also the natural vegetation 
which would be found in the Great Valley of California except for anthropogenic 
vegetation. 

(4) Urban 

Centers of human activity are extremely important as a landscape component 
within this frame they are none randomly located in a size hierarchy of central 
places connected by transport links. The primary urban agglomerations within the 
frame are the San Jose-Oakland segment of San Francisco, and parts of the Sacramenta 
Stockton urban complex. Unlike the preceding categories one of the main image 
features of urban areas is non-uniformity. Both pattern and texture, as well as a 
variety of, tones, are associated v/ith urban complexes. 

(5) Large Irrigated Fields 

1 For practical purposes all field agriculture in the Great Valley is based on 
irrigation . The G reas ccsicjns d to this category would most likely contain field crops 
such as cotton, alfalfa, and other crops readily adaptable to high mechanization. 

Most tree crops and vineyards would also be in this category. 

(6) Small Irrigated Fields 

These areas would contain high value low mechanization crops typified by 
vegetables. 

(7) Water 

Although almost self-explanatory as a term, the category in this instance is 
used to include: (1) ocean, (b) lakes - natural and man-made, and (c) standing 
water (in fields and on flood plains). 

Out of the 648 sub-images in the frame, the photo-interpreters helped us to 
find a unique land use ground truth category for a total of 629 sub-images. Due to 
cloud cover and other ambiguities the ground truth for 19 sub-images could not be 
positively identified. 
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IV. 3 Results of Classification Experiments 

The textural feature vectors for each of the 629 sub-images, for which ground 
truth was available, were divided arbitrarily into training and test sets. The classi- 
fication algorithm was developed using the information contained in the training set 
and the samples in both the training and test set were assigned to one of the seven 
possible land use categories. The accuracy of classification was obtained by com- 
paring the category assigned by the classifier v/ith the ground truth category and 
the results are presented in the following paragraphs. 

Experiment No. 1 

The data set used in this experiment consisted of 416 sub-image samples 
obtained from strip 1 of the large image. There were six land use categories which 
could be identified on this image strip. These categories are: coastal forest; wood- 
lands; annual grasslands; urban areas; water bodies; and large irrigated fields. Two 
classification tests were run on this data set. In the first classification run, 80 per 
cent or 334 of the 416 samples were used for training the classifier and the remaining 
82 samples were used to test the accuracy of the classifier and up to 74.5 per cent 
of the test samples were classified correctly. The contingency table for this 
classification is given in Table 1 . 

In the second classification run, 70 per cent or 293 of the 416 samples were 
used for training. The accuracy of the classifier on the remaining 123 samples was 
found to be 68.5 per cent. 

Experiment No. 2 

All of the 629 sub-image samples for which the ground truth category can 
be identified were used in this experiment. There were seven ground truth categories, 
including the six categories used in the previous experiment and a small irrigated 
field category. Tests of the classifier were conducted using 80 per cent, 70 per 
cent, and 50 per cent of the samples respectively for training and the remaining 
samples for testing the classifier. A summary of the recognition accuracy of the 
classifier is given in Table 2. 
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TABLE 1 


CONTINGENCY TABLE FOR LAND USE CLASSIFICATION 


Assigned 

•v^Gategory 

Tnje'' s C^ 

Category^* 

Coastal 

Forest 

Woodlands 

Annual 

Grass- 

lands 

Urban 

Area 

Large 

Irrigated 

Fields 

Small * 

Irrigated 

Fields 

Water 

Total 









Coastal 

Forest 

9 

1 

1 

0 

1 

0 

0 

12 

Wood- 

lands 

1 

7 

0 

0 

2 

0 

0 

10 

Annual 

Grass- 

lands 

0 

2 

15 

2 

1 

0 

0 

20 

Urban 

Area 

0 

2 

4 

4 

0 

0 

0 

10 

Large 

Irrigated 

Fields 

0 

0 

3 

0 

15 

0 

0 

18 

Small 

Irrigated 

Fields 

0 

0 

0 

0 

0 

0 

0 

0 

Water 

0 

0 

1 

0 

0 

0 

11 

12 

Total 

10 

12 

24 

6 

19 

0 

11 

82 


Number of Training Samples = 334 Number of Tesf Samples - 82 


Accuracy of Classification on Training Set - 86 per cent 
Accuracy of Classification on Test Set = 74.5 per cent 


*There were no samples from small irrigated fields category in this data set. 
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TABLE 2 

SUMMARY OF PERFORMANCE FOR VARIOUS TRAINING SET SIZES 


Percent of Samples 
Used for Training 

80% 

70% 

50% 

Accuracy of the 
Classifier on the 
Training Samples 

78% 

80.5% 

90.5% 

Accuracy of the 
Classifier on the 
Test Samples 

72.5% 

70.5% 

67.5% 


Details of classification for various land use categories for the 70 per cent training 
set are given in Table 4. 
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TABLE 3 


CONTINGENCY TABLE FOR LAND USE CLASSIFICATION OF TEST SAMPLES 


Assigned 

^Category 

True\^ 

Category\^ 

Coastal 

Forest 

s 

Woodlands 

Annual 

Grass- 

Lands 

Urban 

Area 

Large 

Irrigated 

Fields 

Small 

Irrigated 

Fields 

Water 

Total 

Coastal 

Forest 

11 

1 

3 

0 

2 

0 

0 

17 

Woodlands 

1 

9 

3 

0 

3 

0 

0 

16 

Annual 

Grasslands 

0 

2 

61 

3 

5 

0 

1 

72 

Urban 

Area 

1 

1 

8 

4 

0 

1 

0 

15 

Largii 

Irrigated 

Fields 

0 

0 

5 

0 

18 

3 

1 

27 

Small 

Irrigated 

Fields 

0 

0 

10 

0 

1 

10 

0 

21 

Water 

0 

0 

0 

0 

1 

0 

17 

18 

Total 

13 

13 

90 

7 

30 

14 

19 

186 


Number of Training Samples = 443 Number of Test Samples = 186 

Accuracy of Classification on Training Set = 80.5 per cent 
Accuracy of Classification on Test Set = 70.5 per cent 
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V. DISCUSSION 


The overall accuracy of the automatic land use classification procedure 
based on the textural features of ERTS-A imagery is about 70 per cent. A sub- 
stantial portion of the error in the classification procedure results from the classifier's 
inability to distinguish between the woodlands area and the annual grasslands, and 
the inability to correctly identify the urban areas. The difference in the textural 
characteristics of the woodlands area and the annual grassland is rather subtle since 
at lower elevations of the mountainous region the woodlands have decreasing cover 
until they give way to annual grasslands. The inability to correctly identify urban 
areas arise from the fact that urban areas have more than one type of textural ly 
homogeneous areas and some of these areas look very much like other land use 
categories. 

The accuracy of the classifier will be considerably improved if: 

(1) The textural features from all 4 MSS bands are used; 

(2) Some spectral characteristics are used as additional inputs to the 
1 classifier. 

Experiments are currently underway to: 

(1) Obtain the comparative accuracy of a land use classification using 
spectral characteristics of the MSS data; 

(2) Use spectral characteristics as additional inputs to the classifier. 

With a scheme using spectral and textural features we expect a significant 

improvement in the accuracy of the land use classification scheme. Further improve- 
ment in the accuracy of the classifier is possible if the sub-image samples are 
classified utilizing the probabilities of various categories occurring spatially adja- 
cent to each other in the image. For example in the California frame, urban areas 
do not occur adjacent to large water bodies or irrigated fields. However, since the 
classifier assigns each sub-image to a land use category independent of the land use 
categories of the surrounding sub-images this context information is not utilized by 
the classifier. A compound decision rule can be developed to take into account 
this additional information and will lead to higher accuracy of classification. 
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APPENDIX A 


TEXTURAL FEATURES OBTAINED FROM THE GREY TONE DEPENDENCE MATRIX 


In this appendix, we define 16 textural features which are computed for each 
of the four angular grey tone dependence matrices. For each feature, we use the 
mean and range of the four values for the four angles, thus a total of 32 variables 
which are used as inputs to the classifier. 

The following notation will be used in defining the 16 textural features. 

iL 

P(i,j) - (i,j) n entry in a particular grey tone dependence matrix. 




- i^ entry in the marginal distributions of P(i,j) obtained by 
summing rows and columns of P (i , j) respectively. 


#R - number of resolution cell pairs which were considered in computing 
the entries in P(i,j). 

N - number of distinct grey tone values in the image. 

9 

y - mean of P(i,j)/^R- 


(i)-i^ 1 entry in the distribution of the sum of grey tones of neighboring 
^R resolution cells. 



0) 


th 

i entry in the distribution of the absolute differences in the grey 
tones of neighboring resolution cells. 


k 
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TEXTURAL FEATURES 


1. Angular Second Moment: 

N N 

f rSE lM 


i=l j=l 


# R 


2. Entropy: 


f 2' 


I t-m- (v 


3.‘ Correlation: 

1 N N 

. XT' ij 1 Eil/Jl ( - i, ,, 

L L IJ # R | x y 

u - 


CT x'°y 


where u and a are the mean and standard deviation of P . and 
x x x 

and a are the mean and standard deviation of P . 

My y y 


4. Sum of Squares on x: 

i=l j=l ' 


5. Product Moment: 

N N 

g g 


f 5 = ZZ 
i=i j=i 


PQ/j) 

#R 



6. Inverse Moment - : 


N N 

f ,=y y _ !_ 

6 M , + H> 2 ' #R > 


7. Difference Moment: 


N N 

g g 


, .\2 


f 7 = Z) H) 


i=l i=l 


p M l 

#R i 


8. Sum Average: 


2N 



f 

g 


= variance of P , 
x + 



10. Sum Entropy: 



1 1 . Contrast: 
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12. Difference Variance: 

P 

, _ r x-y 

f 10 - variance or — 

# R 

13-. Difference Entropy: 



14, 15, 16. Additional Measures of Correlation : 

f - HXY - HXY1 
14 max HX,HY 


f ]5 = \/l " exp[-2.0(HXY2-HXY)3 


f 1 6 = -^econd largest eigenvalue of QQ^ 


where HX and HY are the entropies of the marginals of the transition matrix 
before quantization, HXY is the entropy of the transition matrix, and 
HXY2 is the entropy of the product distribution of the rnarginals before 
quantization; 

Q(i,j) = P( i .j)/V / P x (!>P y O> 


V s 


the maximal correlation coefficient. 
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